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METHOD OF TRANSPOSON-MEDIATED MUTAGENESIS EST THE 
NEMATODE CAENORHABDITIS ELEGANS 

1. FIELD OF THE INVENTION 

The present invention relates to methods for generating and identifying mutations 
in the genome of the nematode Caenorhabditis elegans (hereinafter "C elegaits''). More 
specifically, the present invention relates to a tiansgene construct for expression in C. 
elegans, and to methods for regulating mobiUzation of heterologous or endogenous 
transposons in the C elegans genome, inserting a heterologous DNA sequence into C. 
elegans germline DNA, and engineering mutations into the C elegans genome. 

2. TECHNICAL BACKGROUND 

The use of model genetic systems had its beginnings in the earliest days of the 
science of genetics and, as a result of the tremendous value of such systems in 
understanding genetic phenomena, continues in the present. Researchers often use in their 
work organisms which have short life spans, limited space requirements, and relatively 
small genomes. Specifically, certain species of worms, fiiiit flies, and yeast cells are 
common subjects of research. Using such organisms, researchers may learn the fimction 
of the various genes found within the DNA of the organisms. One commonly used 
method is to generate mutations in the genome of an organism, followed by selection or 
screening for those mutations which confer a specific property or characteristic to the 
organism. These mutational studies suggest probable functions for the genes in which 
mutations occur. Mutations often occur when a gene is changed in such a way that the 
product of the gene is altered or nonfimctional. 

A common method for generating mutations uses transposable elements. 
Transposable elements are segments of DNA which have the ability to 'liop' — that is, to 
be excised from their initial position in the DNA and move to a new location. In doing 
this, a transposable element, also known as a transposon, may insert into some portion of 
a gene, thus disrupting or even changing the fimction of the gene. Further, additional 
mutations may be created by remobilizuig the transposon. Since this remobilization often 
occurs imperfectly, changes are created in the DNA sequence, leaving the final sequence 
different from the original sequence. See J.D, Watson, J. Witkowski, M, Gihnan, and M. 
Zoller, Recombinant DNA 175-190, 439-440 2d. ed. (1996). 
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The P element, a transposable element fomid in the genes of fimit flies, see, e,g,, 
A. C. Spradling, G. M. Rubin, Science 218, 341 (1982); J.D. Watson, J. Witkowski, M. 
Gihnan, andM. ZoUer, Recombinant DNA 175, 177 2d. ed. (1996). has been an 
enormously useful tool in Drosophila genetic analysis for two reasons. First, these 
transposons have been used for insertional mutagenesis. Mutagenic insertions constitute 
molecular tags that are used to rapidly clone the mutated gene. L. Cooley, R. Kelley, A, 
Spradling, Science 239, 1 121 (1988). Particularly helpful in such studies is the presence 
of strains that lack any copies of the transposon. Second, P elements are used to introduce 
single copies of foreign sequences into the host genome. This feature is particularly useful 
for the rapid identification of gene expression patterns by using enhancer traps. H. J. 
Bellen, et al.. Genes Dev. 3, 1288 (1989). The availability of such techniques would be 
particularly advantageous in studies of the genome of the nematode C elegaiis. 

C elegaiis is a model system in which genetics can be used to identify genes and 
biological pathways which are conserved between nematodes and vertebrates, and which 
thus constitute potential targets for the treatment of various diseases. C elegans is 
particularly advantageoiis for genetic studies because it is easily propagated and because 
the genetic and physical maps of its genome are well-characterized. W. B. Wood, 
Introduction to C elegans Biology (1988). The characterization of gene stmcture in 
C. elegans has become routine, largely through the efiforts of the C elegans genome 
project. The workers involved in this effort have cloned the entire genome into cosmid or 
YAC vectors and have completed the genomic sequence. C. elegans Sequencing 
Consortium, Science 282:2012-2018 (1998); A. Coulson et al., Proc Natl Acad Sci USA 
83:7821-7825 (1986); A. Coulson et al., Bioessays 13:413-417 (1991); R. Wilson et al., 
Nature 368:32-38 (1994). 

Standard mutagenesis m C. elegans employs chemical mutagens. After generation 
of a mutant, identification of the gene requires time-consuming genetic mapping followed 
by single gene rescue. Alternatively, transposon-based mutagenesis has been attempted 
using mutant backgrounds like mut'2, but efficiency of transposition is low and not 
specific for a defined transposon class. Further, since the genomes of all C. elegans 
strains contain transposons, it is very difficult to identify relevant insertions. Thus, utility 
of native transposons for regulated transposition in C. elegans is limited. First, all strains 
contain multiple copies of these transposons and thus new insertions do not provide 



wo 00/073510 



PCTAJSOO/40091 



unique tags. Second, mutator strains tend to activate the transposition of several classes of 
transposons, so that the type of transposon associated with a particular mutation is not 
known. Third, transposition is not regulated and the transposon tag can be lost by excision 
in subsequent generations. Fourth, attempts to regulate transposase expression have failed 
because expression of transgenes in the gennline of C elegans is very difficult. Although 
5 one could theoretically regulate the transposition of a specific element by expressing the 
transposase under the control of a gennline-specific promoter, transgenic arrays are 
typically silenced in the gennline. W. G, Kelly, S. Xu, M. K. Montgomery, A. Fire, 
Genetics 146, 227 (1997) 

Another problem in this field is the difficulty of expressing DNA in the C elegans 

10 germline. Current methods, see, e.g., W. G, Kelly et al, Genetics 146:227-238 (1997), 
are not adequate, First, current methods for expressing foreign DNA in the C elegans 
gennline do not work for all genes. Second, expression of genes introduced using these 
methods dechnes over time. 

Finally, introduction of single copy DNA is not possible using existing 

15 technology. 

From the foregoing, it will be appreciated that it would be a significant 
advancement in the art to provide methods that allow reg^llated expression of foreign 
, DNA in the C. elegans gennUne. It woidd be a fiirther advancement to provide methods 
that allow gennline expression of a transgene in C elegans. It would be a fiirther 

20 advancement in the art to provide regulated expression of such a transgene in the 
germline, as by regulation using a heat-shock promoter. It would be a fiirther 
advancement to provide methods of regulating the transposition of either endogenous or 
heterologous transposons in C elegans. Further, it would be an advancement to provide 
transgene constructs to facilitate germline expression of transgenes and regulated 

25 transposition of homologous and heterologous transposons; Such compositions of matter 
and methods are disclosed herein. 

3. BRIEF SUMMARY OF THE INVENTION 

The present invention relates to improved methods for generating and identifying 
30 mutations in C. elegajis^ and includes methods for introducing heterologous DNA into the 
C. elegans germline and causing its expression. In certain embodiments, a method of the 
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present inveation comprises the steps of inserting a transgene construct into the C. 
elegans, wherein the construct comprises a heterologous gene operably linked to a 
promoter and a 3 ' untranslated region of a gene that is expressed in the C. elegans 
germline; and expressing the heterologous gene. In certain embodiments, this method 
further comprises the removal of all bacterial plasmid sequences and repeated sequences 
5 from the DNA to be introduced. Li certain preferred embodiments, a promoter that is 

active in the C elegans germline drives expression of the transgene. In certain especially 
preferred embodiments, the promoter is an mducible promoter. 

The present invention further relates to a transgene construct for expression in C. 
elegajis which comprises a heterologous gene operably linked to a promoter and a 3* 
10 untranslated region of a gene expressed in the C elegans genome. In certain 

embodiments, the transgene construct fiirther comprises a promoter that is active in the 
germline of C elegans or a promoter that is inducible. 

The present invention further relates to methods for generating and identifying 
mutations in C elegans. In one embodiment, a method of the present invention comprises 
15 the introduction and expression of a transposase gene to mobilize either endogenous or 
heterologous transposons. In certain preferred embodiments, the transposons are 
endogenous Tc3 transposons. 

In certain other embodiments, the transposons are heterologous transposons, such 
as the Drosophila mariner element. Controlled mobilization of heterologous transposons 
20 allows the generation of mutations, which are tagged by the insertion of the transposon. 
PCR-based techniques permit rapid identification of the transposon insertion that caused 
the mutation. 

The present invention further relates to methods for introducing single copy DNA 
sequences into C elegans. In certain preferred embodiments, a method of the present 

25 invention comprises introducing a transposon comprising a heterologous DNA sequence 
into a C. elegans, introducing a transgene construct comprising a transposase gene 
operably linked to a promoter and a 3' imtranslated region of a gene that is expressed in 
the C elegans germline, and expressing the transposase such that the transposase 
integrates into a C. elegans chromosome as a single copy. The transposon may be 

30 engineered to introduce a DNA sequence, such as one that codes for a reporter gene such 
as, for example, a green fluorescent protein. The introduced DNA sequence may also 
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contain FRT/FLP or CRE/LOX recombination sites. Alternatively, the introduced DNA 
sequence may contain polyadenylation sites or transcriptional terminators. 

These and other features and advantages of the present invention will become 
more fully apparent from the following detailed description. 

5 4. SUMMARY OF THE DRAWINGS 

Figure 1 schematically depicts a method for mutagenesis by controlled 
heterologous transposition. 

Figure 2 depicts the structure of the pJL44 Mosl transposase expression vector. 

Figure 3 schematically depicts a method for identifying sequences flanking the 
10 Mos 1 insertion site using inverse PCR. 

Figure 4 depicts the sequence (SEQ ID NO.: 23) of an inverse PCR product. 
Nucleotides in capital letters are from HhtMosl transposon. The C. elegmxs flanking 
genomic region is in lower case. It matches the Y47C4,A sequence from chromosome X 
available at the Sanger Centre. See the Sanger Centre web site at 
15 http://www,sanger.ac.uk/Projects/C_eIegans/ . 

Figure 5 depicts the mobilization of Mosl in C. elegans somatic cells. (A) 
Engineering of the Mos transposase encoding sequence. Restriction sites were generated 
at the 5* and 3' ends of the coding sequence (new sequence is indicated under the original 
sequence). The endogenous polyadenylation signal (boxed) was disrupted and an 
20 artificial intron was introduced in the coding sequence in order to improve transposase 

expression. See A. Fire, S. W. Harrison, D. Dixon, Gene 93, 189 (1990). (B) Localization 
of Mosl insertions in xmC'49 and gpa'2 genes after induction of Mos transposase 
expression in somatic cells. Open triangles: insertion sites; black rectangles: coding 
exons; white rectangle: non coding exonic sequence. Arrows: genomic primers used to 
25 amplify the insertions. (C) Sequence comparison of 22 insertion sites. Insertion sites are 
oriented relative to the 5' end of the Mos/ transposon. Sequences that flank Mosl at the 
right end were identified by PCR. DNA purification and PCR were perfomxed as 
described m H. G. van Luenen, S. D. Colloms, R. H. Plasterk, EinboX 12, 2513 (1993). 
The primers in Mosl were 0JL88 (S'-CGCATGCGGCTTACTCAC (SEQ ID NO: 4)) 
30 first PCR; and oJL89 (5'-GGCCCCATCCGATTACCACCTA (SEQ ID NO: 5)) second 
PCR. Primers in unc'49 were oJL19 (5'-GCGAAACGCATACCAACTGTA (SEQ ID 
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NO: 6)) first PGR; and oJL20 (5'-TTCATGCCGAAAAGCAGGCGT (SEQ ID NO: 7)) 
second PGR. Primers in gpa'2 were the same as described in H. G. van Luenen, S. D. 
CoUoms, R. H. Plasterk, Embo J. 12, 2513 (1993). PGR products were gel-purified and 
sequenced using oJL89 (SEQ ID NO: 5) as a primer, (positive positions on the graph), 
sequences that flank the left end of Mosl were deduced firom unC'49 and gpa-2 sequences 
5 (negative positions on the graph). 

Figure 6 depicts germline mobilization of Mosl. (A) Mos transposase was 
expressed firom an extrachromosomal array using either a glh-2 or a heat-shock promoter. 
The Mosl transposon was contained in an array integrated on chromosome V 
(oxIs25[Mosl;rol-6(sd]). The array containing chromosome was balanced by the dpy- 

10 1 l(e224) mutation. In the next generation, catastrophic excision of the transgene was 
observed (indicated as Aoxis) among the progeny. (B) Comparison of excision and 
insertion firequencies using glh'2 and heat-shock (hsp) promoters to drive M95 
transposase expression in the germline. New Mosl insertions were identified by PGR. 
Specifically, the presence of Mosl was detected through PGR by using two primers 

15 located in the transposon, oJL102 (SEQ ID NO: 1) and oJL103 (SEQ ID NO: 3). The 

absence of D. mauritiana flanking sequence was checked using oJL102 (SEQ ID NO: 1) 
and 0JLIO4 (SEQ ID NO: 2) as described below. In addition, a PGR positive control was 
performed on each DNA sample usiug oligonucleotides located in the cha-l gene. 
Recombination events were recognized as Dpy worms also containing Mosl flanked by 

20 original Drosophila genomic sequences. 

Figure 7 shows Mosl genomic insertions. (A) Southern blot probed with labelled 
Mosl DNA. Lane 1 to 8, strains in which insertions were detected by PGR; insertions 
derived fi*om an extrachromosomal array and Mos transposase expressed under the heat- 
shock promoter. Mosl presence was assessed by PGR using two primers located in the 

25 transposon (oJL102 (SEQ ID NO: 1) and oJL103 (SEQ ID NO: 3)). The absence of £>. 
mauritiana flanking sequence was checked using oJL102 (SEQ ID NO: 1) and oJL104 
(SEQ ID NO: 2), while a PGR positive control was performed on each DNA sample using 
oligonucleotides located in the cha~l gene. The control lane is lin-15(n765) which had 
been used to build transgenic lines. Each lane contaios 2 mg of Bgl II digested genomic 

30 DNA. Th^Mosl probe (encompassing bases I to 174 of the transposon) was synthesized 
by PGR using the pBluescribeMlS+ZMoslplasmid as a template. (B) Distribution of 
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Mosl inserts on the physical map of the C. elegans genome. Black triangles: insertions 
from aa extrachromosomal array. Open triangles: insertions from the integrated array 
oxIs25. Open circle: position of oxIs25, the integrated array of Mosl transposons. (C) 
DNA sequence of Mosl de novo insertions. Genomic fragments that flank the transposon 
left end were isolated by inverse PGR and sequenced. A primer was designed in the 
5 genomic region to the right of the insert and used with a Mosl specific primer to amplify 
and sequence the right end flanking the fragment. At insertion sites TA dinucleotides 
(bold) were duphcated during the process of transposon integration. Lower case: Mosl 
sequence. Upper case: genomic sequence. 

Figure 8 shows a knock-in strategy wherein Mosl excision causes a DNA double 
10 strand break, after which a transgene containing sequences homologous to the excision 
region pairs with the chromosome. Finally, the mutation contained in the transgene is 
copied into the chromosome. 

These drawings only provide information concerning typical embodiments of the 
invention and are not therefore to be considered limiting of its scope, 

15 

5. DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to novel methods for generating and identifying 
mutations in C. elegans, and includes methods for introducing a transgene into the C. 
elegans germline. The present invention also includes methods for expressing transgenes 

20 in the C elegans germline. The present invention also includes a transgene construct for 
expression in C. elegans and methods for generating mutations by regulating the 
transposition of endogenous or heterologous transposons in C. elegans. The present 
invention also includes methods for inserting single copy DNA into a C elegans genome 
by introducing a transgene comprising FLP Recombination Target (FRT) sites into a C. 

25 elegans and causing recombination. FLP is a site-specific recombinase which efficiently 
catalyzes recombination between FRT sites that have been placed in the genome. K, G. 
Golic, S. L. Lindquist, Cell 44, 521 (1986). When FRT sites are in the same relative 
orientation within a chromosome, the FLP recombinase excises the intervening DNA 
from the chromosome. See Golic, Kent G., Generating Mosaics By Site-Specific 

30 Recombination, In Cellular Interactions In Development: A Practical Approach, 1-3 1 (D, 
A. Hartley, ed., Oxford Univ. Press 1993); and Plasterk R.H., Groenen J.T., Targeted 
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Alterations of the Caenorhabditis elegans Genome by Transgene Instructed DNA Double 
Strand Break Repair Following Tel Excision, EMBO 7., 11:287-90 (1992). Other 
recombination systenas, such as CRE/LOX, could also be used. 

All publications, patents, and patent applications cited herein are hereby 
incorporated by reference. U.S. Patent Application Serial No, 60/136,972 is hereby 
5 incorporated by reference in its entirety. 

DEFINITIONS 

The term "heterologous" is used herein to include nucleic acid sequences such as 
coding sequences and control sequences that are not normally joined together, and/or are 
not normally associated with a particular cell. Thus, a heterologous region of a construct 

10 or vector is a segment of nucleic acid within or attached to another nucleic acid molecule 
that is not foxmd in association with this other molecule in nature. For example, a 
heterologous region of a nucleic acid construct could include a coding sequence flanked 
by sequences not found in association with the coding sequence in nature (e.g., synthetic 
sequences having codons different from the native gene). Similarly, a cell transformed 

15 with a construct which is not normally present in the cell woxild be considered 

heterologous for purposes of this invention. The term includes, but is not limited to, a 
DNA sequence from another organism. 

The term '*transgene" is a heterologous sequence that is introduced into an 
organism. The term includes both sequences that integrate into one or more chromosomal 

20 locations of the organism and sequences that are maintained extrachromosomally, e.g., as 
episomes. 

The term "regulable expression control element" includes promoters, 
polyadenylation signals, transcription termination sequences, upstream regulatory 
domains, origins of replication, internal ribosome entry sites, enhancers, and the like, 

25 which provide for the replication, transcription, and translation of a coding sequence in a 
recipient cell or in a cell of an organism. The term promoter refers to a DNA sequence, 
that is capable of binding RNA polymerase and initiating transcription of a downstream 
(3* direction) sequence. Inducible promoters are promoters which are regulable. Such 
promoters may be regulated by, for example, temperature, small molecules, or 

30 developmental stages of an organism. 

Inducible promoters include heat-shock promoters, which are induced by exposiure 
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to heat. Inducible also promoters include small raolecule-regulated promoters. Other 
inducible promoters include promoters that are induced (or repressed) by tetracycline and 
its derivatives (Gossen & Bujard, Proc, Natl. Acad, ScL USA 89:5547-5551 (1992)). 

"Operably linked" refers to an arrangement of elements in which the components 
so described are configured so as to perform their usual function. Thus, control sequences 
5 such as regulable expression control elements operably linked to a coding sequence are 
capable of affecting the expression of the coding sequence. The control sequences need 
not be contiguous with the coding sequence so long as they function to direct the 
expression thereof. Thus, for example, intervening untranslated yet transcribed sequences 
can be present between a promoter sequence and the coding sequence and the promoter 

10 sequence can still be considered "operably linked" to the coding sequence. 

For the purpose of describing the relative position of nucleotide sequences in a 
particular nucleic acid molecule throughout the instant application, such as when a 
particular nucleotide sequence is described as being situated "upstream," "downstream," 
"3'," or "5'," relative to another sequence, it is to be understood that it is the position of 

1 5 the sequences in the "sense" or "coding" strand of a DNA molecule that is being referred 
to as is c'onventional in the art. 
^ GENERAL METHODS 

In one embodiment, a method of regulated expression of a heterologous gene in 
cells of the germline of C elegans comprises the steps of inserting a transgene construct 

20 into the C. elegans, wherein the construct comprises the heterologous gene operably 
linked to a promoter and a 3 ' untranslated region of a gene that is expressed in the C. 
elegans germline; and expressing the heterologous gene. Other embodiments further 
comprise the use of a promoter which is inducible, such as a heat-shock promoter or a 
tetracycline-regulated promoter. Yet other preferred embodiments comprise the removal 

25 of substantially all bacterial plasmid sequences and repeated sequences from the 

transgene. In certain other especially preferred embodiments, the method of the present 
invention comprises the addition of the 3 ' untranslated region (UTR) of the glh'2 gene, 
which is expressed in the C elegans germline, to the 3' end of the transgene. A promoter, 
such as aglh'2 promoter, which is a germline specific promoter, may be used to drive 

30 expression of the transgene. 

The present invention further comprises transgene constructs for expression in C 



9 



wo 00/073510 



PCT/USOO/40091 



elegans which comprises a heterologous gene operably linked to a promoter and a 3 ' 
untranslated region of a gene that is expressed in the C. elegans germline. In certain 
embodiments, the promoter is active in the cells of the gennline of C elegaiis. In other 
embodiments, the promoter comprises an inducible promoter such as a heat-shock 
promoter or a tetracycline-regulated promoter expressed in the gennline. Yet other 
5 embodiments comprise the removal of substantially all bacterial plasmid sequences and 
repeated sequences jfrom the transgene. In certain preferred embodiments, the method of 
the present invention comprises the addition of a 3' untranslated region (UTR) oi^glh-2 
gene, which is expressed in the C elegans germline, to the 3* end of the transgene. A 
promoter, such as a glh-2 promoter, may be used to drive expression of the transgene. In 

1 0 still other embodiments, the heterologous gene codes for a transposase. In certain 
preferred embodiments, the heterologous gene is a TC3 A transposase g^e. 

The present invention also includes methods for generating mutations in the 
genome of a C elegans by using controlled mobilization of transposons. In certain 
embodiments, the method for generating mutations in the genome of a C. elegans 

15 comprises the steps of introducing a transgene construct comprising a transposase gene 
which is operably linked to a regulable expression control element and a 3' untranslated 
region of a gene that is expressed in the C elegans germline into a C. elegans; and 
expressing the transposase gene. Such methods allow the generation of mutants in which 
the mutated genes are tagged by the insertion of the transposon, PCR-based techniques 

20 permit fast identification of the transposon insertion that causes the mutation. Since the C. 
elegans genome has been entirely sequenced, sequencing of the genomic regions that 
flank the transposon allows immediate identification of the mutated gene. 

In certain embodiments, transposons used in the method of the present invention 
are endogenous transposons. Several diiferent types of endogenous transposons are 

25 present in C elegans, and these can be mobilized in mutator strains. See, e.g., R. H. A. 
Plasterk, H. G. A. M. van Luenen, in C elegans II, D. L. Riddle, T. Blumenthal, B. J. 
Meyer, J. R. Priess, Eds. (Cold Spring Harbor Laboratory Press, New York, 1997) pp. 97- 
116. Mutator alleles have been useful in cloning C. elegans genes, particularly in early 
studies before the genome project reagents were widely available. In certain preferred 

30 embodiments, the endogenous transposons are Tc3 transposons. In yet other 
embodiments, the transposase gene is a TC3 A transposase gene. In still other 
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embodiments, the regulable e^qjression control element is an inducible promoter, 
comprising in some embodiments a heat-shock promoter or a tetracycline-regulated 
promoter. Yet other embodiments comprise the removal of substantially all bacterial 
plasmid sequences and repeated sequences from the transgene construct In certain 
preferred embodiments, the method of the present invention comprises the addition of the 
5 3' untranslated region (UTR) of the glh-2 gene, which is expressed in the C. elegans 
germline, to the 3' end of the transgene. In such embodiments, the regulable expression 
control element comprises a promoter, such as a glh'2 promoter or a heat-shock promoter, 
which may be used to drive expression of the transgene. 

In other embodiments, mutants may be generated by xxsing controlled mobilization 

10 of heterologous transposons. Using a heterologous transposon allows researchers to tag 
mutated genes with a sequence that is unique in C. elegans genome. These tagged 
mutations will allow the rapid cloning of the mutated genes. The primary advantage over 
the endogenous transposon scheme is that this method avoids the isolation of irrelevant 
insertions of the endogenous C elegans transposons. A further advantage is that the 

15 expression of the heterologous transposase would only mobilize the heterologous 
element, and thus mutations should only be due to insertions of these elements. 
Additionally, insertions could be stabilized by loss of the trar^osase-expressing 
construct. 

In certain preferred embodiments, Mosl, a marmer-like transposon isolated from 
20 Drosophila niauritiana, is used. M. Medhora et al., Genetics 128:31 1-318 (1991). See 
generally D. L. Hartl et al, Annu. Re\K Genet 31:337-358 (1997). Mosl is a member of 
the mariner/Tcl family and was initially identified in the fruitfly Drosophila mauriiiana, 
J. W. Jacobson, M. M. Medhora, D. L. Hartl, Proc. Nail, Acad. ScL USA 83, 8684 
(1986). Like the other members of the mariner/Tcl family, Mosl contains a single open 
25 reading frame which encodes the transposase. The transposase binds to and cleaves at the 
inverted terminal repeats (ITRs) present at each end of the transposon. See, e.g, D. L. 
Hartl, A, R. Lohe, E. R. Lozovskaya, Annu. Rev, Genet 31, 337 (1997); R. H. Plasterk, Z. 
Izsvak, Z. Ivies, Trends Genet 15, 326 (1999). The Mosl transposase is the only protein 
necessary for transposition in vitro, L. R. Tosi, S. M. Beverley, Nucleic Acids Res, 28, 
30 784 (2000). Because no additional factors are required for transposition, the Mosl 

transposon should be capable of transposition in heterologous species, and indeed the 
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transposon has been mobilized in species evolutionarily distant from Drosophila, F. J. 
Gueiros-Filho, S. M. Beverley, Scimce 276, 1716 (1997); J. M. Fadool, D. L. Hartl, J. E. 
Bowling, Proc. Natl Acad. Set USA 95, 5182 (1998); A. Sherman, et al., Nat 
Bioiechnol 16, 1050 (1998); C. J. Coates, N. Jasinskiene, L. Miyashiio, A. A, James, 
Proc, Natl Acad, Scl USA 95, 3748 (1998). 
5 In other embodiments, the transposase gene comprises restriction sites 5' of the 

start codon, restriction sites 5' of the stop codon, and an artificial intron in the transposase 
gene open reading frame. Other preferred embodunents involve a regulable expression 
control element which comprises an inducible promoter such as a heat-shock promoter or 
a tetracycline-regulated promoter. Yet other preferred embodiments comprise the removal 

10 of substantially all bacterial plasmid DNA sequences and repeated sequences from the 
transgene construct. In certain preferred embodiments, the method of the present 
invention comprises the addition of the 3' untranslated region (UTR) of the glh'2 gene, 
which is expressed in the C. elegans germline, to the 3' end of the transgene. In such 
embodiments, the regulable expression control element may comprise a promoter, such as 

15 a glh'2 promoter, a myo-3 promoter or a heat-shock promoter, which may be used to drive 
expression of the transgene. 

In yet other embodiments, the method of the present invention includes 
engineering the transposon to carry a heterologous DNA sequence into a C. elegans 
chromosome. Certain embodiments of the method of the current invention may comprise 

20 the steps of introducing a transposon into the C. elegans, whereia the transposon 

comprises the heterologous DNA sequence; introducing a transgene construct into the C. 
elegans, wherein the construct comprises a transposase gene which is operably linked to a 
promoter and a 3' untranslated region of a gene that is expressed in the C. elegans 
gennliae; and expressing the transposase, such that the transposon integrates as a single 

25 copy into a C, elegans chromosome. 

In some embodiments, the transposon may be modified to contain bacterial 
plasmid DNA sequences. Such sequences may simplify cloning of mutated genes into 
bacteria from C. elegans genomic DNA preparations. In yet other embodiments, ttie 
transposon may carry a gene usefiil for selection or screening pmposes. 

30 In certain preferred embodiments, FRT/FLP or CRE/LOX recombination sites 

could be inserted into the transposon. One of skill in the art would appreciate that an 
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engineered transposon cairying such recombination sites would facilitate insertion of 
single copy DNA into the C elegans genome. In other embodiments, the transposon 
could include polyadenylation sites or transcriptional terminators. 

In yet other preferred embodiments, the promoter is inducible. In such 
embodiments, inducible promoters such as a heat-shock promoter, may be used. Yet other 
preferred embodiments comprise the removal of substantially all bacterial plasmid DNA 
sequences and repeated sequences from the transgene construct. In certain preferred 
embodiments, the method of the present invention comprises the addition of the 3' 
untranslated region (UTR) of the glh-I gene, which is expressed in the C elegans 
germline, to the 3* end of the transgene. In such embodiments, the regulable expression 
control element may comprise a promoter, such as zglh-2 promoter, a myo-3 promoter or 
a heat-shock promoter, which may be used to drive expression of the transgene. 

6. EXAMPLES 

The following examples are given to illustrate several embodiments which have 
been made within the scope of the present invention. It is to be understood that these 
examples are neither comprehensive nor exhaustive of the many types of embodiments 
which can be prepared in accordance with the present invention. 

Example 1 - Mobilization of Endogenous Tc3 Transposons 

About 15 copies of the Tc3 transposon are present in the genome of the wild-type 
C elegans N2 strain. These transposable elements are inactive in wild-type animals. Our 
goal is to cause specific mobilization of the endogenous Tc3 copies by expressing the 
TC3A transposase in the germline. New Tc3 insertions will be used as tags to clone the 
genes which they have disrupted. 

1- TC3A expressed under the ced'9 promoter causes somatic and germline hops: 

The TC3 A transposase gene has been cloned behind a ced-9 promoter. This 
construct has been coinjected with linearized C elegans genomic DNA and the lin-15(+) 
plasmid into a lin-15(-) strain and unstable transgenic strains have been obtained. 
Transposase activity was assayed by testing whether the construct could excise a Tc3 
element from the unc-22 gene and restore the function of the locus. The ced-9: :Tc3A 
arrays have been crossed into unc'22(r750::Tc3): lin'15(n765ts) background. Wild-type 
revertants have been recovered from Unc-22 Fl animals, suggesting functional expression 
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of the TC3A transposase. One of these extrachromosomal arrays was integrated into a 
chromosomal location to generate the insertion oxIsl7[ced-9::Tc3A: lin'15(-^)], 

oxIsJ 7 was mapped on the X chromosome and functionally characterized, 
lin'13(n765ts);oxTsl7[ced-9::Tc3A; lin-15(^)] males were crossed to unc-22(r750::Tc3): 
lin-15(n765ts) hermaphrodites. Heterozygous nonUnc nonLin hermaphrodites were 
5 cloned and allowed to self-fertilize. It was expected that among the progeny of these 
animals, there would be found 1/4 Unc animals homozygous for unc-22(r750::Tc3). Of 
those Unc individuals, 3/4 should he either homozygous or heterozygous for oxlsl 7 i.e. 
nonLia It was observed, however, that tiie Unc nonLin animals were greatly under- 
represented — ^instead, there were many more nonUnc individuals. Since the ced-9 gene is 

10 ubiquitously expressed, it was reasoned that TC3A could be present not only in the 
germline but also in somatic cells and could cause somatic reversion of the Unc 
phenotype. To test this hypothesis, nonUnc nonLin individuals were cloned, assmrdng 
that a fraction of them could be homozygous for unC'22(r750::Tc3) despite their wild- 
type phenotype. Self-progeny of these animals were scored. Individuals heterozygous for 

1 5 oxlsl 7 segregated 1/4 Lin animals (which no longer expressed the TC3 A transposase). In 
this category, plates were identified in which 100 % of the Lin worms were Unc while 
almost 100% of the nonLin were nonUnc. Hence, the parent hermaphrodite must have 
been homozygous unC'22(r750::Tc3) mutant although its phenotype was wild-type. 
These data demonstrate that ced-9::Tc3A causes somatic reversion of the unc- 

20 22(r750: :Tc3) locus at high frequency. 

Rare nonUnc Lin were looked for among the Lin animals generated by self- 
fertilization of unc-22(r7 50: :Tc3); lin-lSf-); oxlsl 7/+ hermaphrodites to determine 
germline reversion rates. Since the Lin worms had lost oxlsl 7 and had no TC3 A 
transposase expressed somatically during development, the only way to revert the Unc 

25 phenotype was to receive one reverted copy of the unc~22 locus. This reversion event had 
to occur during germline development. Rare nonUnc Lin progeny were identified among 
Unc Lin progeny (experiment #1 ; 1 nonUnc Lin m 61 Lin total; exp. #2: 2/106; exp. #3: 
4/203). It was concluded that ced-9::Tc3A causes an approximately 2% reversion rate of 
the unc-22(r750::Tc3) locus in the germline. 

30 2- Expression of TC3A using a glh'2 promoter: 

Since the somatic reversion caused by ced'9::Tc3A causes a discrepancy between 
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the phenotype and the genotype of an individual carrying a locus disrupted by a Tc3 
insertion, a TC3A expression vector was designed based on the germline specific glh-2 
gene (gift of Karen Bennett), A plasmid containing a glh-Z genomic fragment is able to 
rescue the Glh-2 mutant phenotype and is therefore likely to be expressed in the germline. 
The glh-2 open reading frame was deleted and replaced by a multiple cloning site to 
generate an expression cassette that retains glh-l promoter and 3' untranslated regions, 
Tc3A was inserted to generate ^//t-2;.TcJ^. This construct has been coinjected with 
linearized C elegans genomic DNA and the Iin'15(+) plasmid into a lin-15(') strain and 
several unstable transgenic strains have been obtained, A plasmid driving strong 
expression of the Green Fluorescent Protein (herebiafter "GFP") in the coelomocytes (gift 
of Piah Sengupta) has been incorporated also in this array. This allows monitoring of the 
presence of the array in a lin-15(^) background based on GFP expression. 

As described above, these arrays have been crossed into unc-22(r750::Tc3); 
lin~15(n765ts) background. In contrast to the ced-9::Tc3A experiments, no somatic 
reversion events were observed. Germline reversion events were observed in the progeny 
of unC'22; oxEx[glh'2::Tc3A] hermaphrodites (experiment #1: 2/1914 total scored 
animals; experiment #2: 5/4312). It was concluded that glh-2::Tc3A causes a 0.1% 
reversion rate of the unc-22(r750::Tc3) locus ia the germline. 

Example 2 - Expression of the Mariner Transposase in C elegans 
A mutagenesis strategy was also developed that uses the mariner transposon from 
the homfly (gift of David Lampe and Hugh Robertson). Mariner transposons from 
Drosophila are related to C. elegaris Tc transposons. In fact, members of the Tc/mariner 
family of fransposable elements have been identified in a broad range of species. R. H. A. 
Plasterk & H. G. A M. van Luenen, Traitsposoits, in C. elegans II 97-1 16 (D,L. Riddle et 
al. eds., 1997). Horizontal transfer may be responsible for the broad distribution of this 
family of transposable elements. Horizontal transfer implies that specific host factors are 
not required for transposition and biochemical characterization has borne this supposition 
out. Purified transposase is able to catalyze the transposition of mariner or of Tc elements 
from a host plasmid to a target plasmid. D. J. Lampe et al., EMBO J 15:5470-5479 
(1996); J. C. Vos et al.. Genes Dev 10:755-761 (1996). This has enabled researchers to 
mobilize mariner elements from Drosophila in other Dipteran species, T. G. Loukeris et 
al., Proc Natl Acad Sci USA 92:9485-9489 (1995); T. G. Loukeris et al., Science 
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270:2002-2005 (1995); A, R. Lohe & D. L. Hartl, Genetics 143:365-374 (1996). 
Recently, a mariner element from Drosophila has been mobilized in Leishmanial which 
represents a trans-kingdom transposition. F, J. Gueiros-FiUio & S. M. Beverly, Science 
276:1716-1719 (1997). Thus, it was possible that mariner would be active in C elegants 
as well. 

5 A plasmid encoding the mariner transposase BDMARl was received from David 

Lampe and Hugh Robertson. First, the transposase coding sequence was engineered to 
allow for efficient expression in C. elegans. Restriction sites were inserted immediately 
upstream to the start codon and just before the stop codon to facilitate subcloning of the 
fragment in various expression vectors. An artificial intron was inserted in the open 
10 reading frame since the presence of introns improves the expression level of transgenes in 
C. elegans. 

Engineered Himarl was placed under the control of the muscle specific promoter 
myo'3. The myo-3::Himarl constmct was injected with the lin~15(-^) plasmid into a 
lin'15(-) strain and unstable transgenic strains obtained. Expression of the HIMARl 

15 transposase was examined first by Westrai Blot. Extracts were prepared from oxExfmyo- 
3::Himarl: lin-15('^)J worms, run on a denaturing acrylamide gel and transferred to a 
nitrocellulose membrane. The membrane was probed with previously characterized 
antibodies that recognize the HIMARl protein (provided by David Lampe and Hugh 
Robertson). In extracts of transgenic worms, an approximately 42 kD protein which 

20 corresponds to the expected molecular weight of HIMARl was detected. The signal was 
absent from non-transgenic worm extracts. Using the same antibodies, the protein was 
visualized in situ using immunofluorescence on oxExfmyo*3::Hintarl; lin-15(+)] worms. 
Intense immunoreactivity was detected which was restricted to the nucleivof mxiscle cells. 
These data indicate that the HIMARl mariner transposase is expressed and properly 

25 targeted to nuclei in C. elegans cells. 

Example 3 - Germliae Expression of the Transposase Using the ^lh*2 Promoter 
Generation of heritable Mosl insertions would require expression of the Mos 
transposase in the germline. However, expression of transgenes in the germline of C 
elegans is not possible using standard techniques. Typically, transgenic worms are 

30 generated by injecting plasmid DNA into the gonads of C elegans (C. C. Mello, J. M. 
Kramer, D. Stinchcomb, V. Ambros, Embo J, 10, 3959 (1991)). These fragments then 
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form a simple array of repeated DNA segments. Although gene expression is robust in 
somatic tissues, such simple arrays are not expressed in the germline or are silenced after 
a few generations. Co-injection of genomic DNA with plasmid DNA improves germline 
expression, presumably by preventing tandem repeats in the array. W, G. Kelly, S, Xu, M. 
K. Montgomery, A. Fire, Genetics 146, 227 (1997). To express iheMos transposase in the 
5 germline, an expression vector containing the promoter and the 3* UTR of the glh-2 gene 
was built. This gene encodes a germline helicase which is specifically expressed in the 
gonad. M. E. Gruidl, et al., Proc. Natl Acad ScL USA 93, 13837 (1996). Transgenic 
lines carrying extrachromosomal arrays of the glh-2::Mos transposase construct were 
generated by microinjection. To maximize expression in the germline, constructs were 

10 isolated JSrom plasmid vector sequences and were coinjected with fragmented genomic 
DNA. (The Mos transposase coding sequence was introduced between the promoter and 
the 3' UTR of Specifically, this constmct (pJL9) contams 2.2 Icb of the glh-2 
genomic sequence immediately upstream of the translation start site (nt 29,882 to 32,095 
in cosmid C55B7), an Mlu I-Nhe I cloning site, and 0.8 kb of sequence immediately 

15 downstream of the glh-2 stop codon. An Mlu I-Nhe I firagment containing the Mos 

transposase was subcloned into pJL9 to generate the glh'2::MosTransposase constmct 
lin'15(n765) hermaphrodites were injected with a Spe I - Kpn I fragment of glh- 
2::MosTransposase (injection concentration 10 ng//il), with lin-15('\-) (EKL15) and ofin- 
i-'SfP (pPD97/98) fragments and N2 worm genomic DNA as described above for the 

20 generation of the oxExl66[hsp::MosTransposase] array.) 

Transposase expression in the germline was determined by assaying for excision 
of transposons from a defined chromosomal location. Specifically, the Moj^i -containing 
extrachromosomal array was integrated into chromosome V to generate oxIs25[Mosl;rol- 
6(sd)]. The oxIs25 array was mapped less than 0.54 map units from dpy-1 1 . 

25 Heterozygoxis oxIsl2/dpy-l 1 worms were generated These animals largely segregated 

Dpy and Rol progeny as expected for these closely linked markers (Figure 6). However, * 
when the glh-2: :Mos Transposase transgene was crossed in, approximately 16% of the 
nonDpy progeny were nonRol (15.7 % ± 0.9, mean ± SEM, n=44 plates). The nonRol 
phenotype was stably inherited. It was hypothesized that in those worms, the Mos 

30 transposase excised Mos 1 fiom the integrated array. The resulting DNA breaks were 
responsible for catastrophic excision of the entire locus, including interspersed rol-6 
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copies. The correlated loss of rol-6(sd) and Mos 1 was confirmed by PGR, in which 
NonRol individuals were cloned, selfed and DNA was purified firom the progeny. Mosl 
containing fragments were detected by PGR using one primer complementary to Mosl 
(OJL102: 5'- CAAGCTTGACTGTGGAACGACCATAG (SEQ ID NO: 1)) and one 
primer complementary to Z). mauritiana flanking DNA (oJL104: 5'- 
ACAAAGAGGGAACGCAGACGAGT (SEQ ID NO: 2)). Of 188 nonRol worms, only 
one individual retained a copy of the Mos 1 fragment that was initially present in the 
transgene. Based on the phenotypic reversion of the Rol phenotype, it was calculated that 
1 in 5 chromosomes experienced catastrophic excision of the transgene (20.9 % ± 1.1 %, 
mean ± SEM, n=44 plates) The probability p of a single chromosome containing the array 
of Mosl elements transgene experiencing "catastrophic excision" can be derived from a 
Punnett square where the ratio R of nonRol worms over the total number of the progeny: 
R = l/4p -h l/4p + (l/2p)^. These results demonstrated that the expression 
vector expressed the transposase in the germline and that the Mos 1 transposon in the 
chromosome was recognized as a substrate. 

To determiae if excision of Mos 1 from the array was associated with insertion in 
the genome, the progeny of animals expressing the transposase in the germline were 
screened for de novo insertions. Specifically, usmg PGR, the presence of the Mos 1 
element in the absence of the Drosophila sequences which flank the transposon in the 
array was assayed. Mosl presence was assessed by PGR using two primers located in the 
transposon (oJL102 (SEQ ID NO: 1) and oJLlOS: 5*- 

TCTGCGAGTTGTTTTTGCGTTTGAG (SEQ ED NO: 3)). The absence of Z). 
mauritiana flanking sequence was checked using oJL102 (SEQ ID NO: 1) and oJL104 
(SEQ ID NO; 2) as described above. In addition, a PGR positive control was performed 
on each DNA sample using ofigonucleotides located in the cha-1 gene. Because the 
integrated airay containing unraobilized transposons also contained rol-6(sd), insertions 
were sought in nonRol progeny; specifically, either nonRol animals that experienced 
catastrophic excision of the array or Dpy progeny (Figure 6B) were analyzed. Insertions 
were identified in 1% of nonRol progeny (2/227) and in 10 % of Dpy progeny (1 1/1 16 
F1+F2 Dpys). These results demonstrated that transposition of Mos 1 could be achieved 
in the C elegans germline. However, it was observed that high rates of excision were not 
accompanied by high rates of insertion; these results support previous data indicating that 
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these two processes are not coupled. H. G. van Luenen, S. D. Colloms, R. H. Plasterk, 
Cfe//79, 293 (1994). 

Using integrated arrays as a soiirce of transposons prevents the easy recovery of 
new insertions that occur on the same chromosome; this bias could be circimivented by 
using an extrachromosomal array of transposons. In addition, extrachromosomal arrays 
are not completely stable in raeiosis, which makes the isolation of strains lacking 
unmobilized transposons easy after mobilization. Therefore, it was tested whether ilfo^ 1 
could be mobilized jfrom an extrachromosomal array into the chromosomes. Specifically, 
the glh-2::Mos Transposase construct was used to mobilize transposons from b,Mos 1- 
bearing array (oxExl64[Mosl; rol-6(sd)]). The nonRol progeny firom double transgenic 
animals (oxExl67[glh-2::MosTransposase]; oxExl64[Mosl; rol-6(sd)]) were analyzed 
for transposition events using PGR. An insertion frequency of 1% (3 insertions/ 302 
progeny. Table 1) was detected. Thus, these results closely match those obtained for 
integrated arrays. 

Table 1. Frequencies of Mosl genomic insertions from an extrachromosomal array. 
nonRol progeny of oxExl64[Mosl; rol'6(sd)J; oxExfMosTransposaseJ were analyzed by 
PGR for the presence of Mosl and the loss of tho Drosophila flanking sequences present 
in the donor plasmid. Mosl presence was assessed by PGR using two primers located in 
the transposon (oJLl 02 (SEQ ID NO: 1) and oJL103 : 5'- 
TCTGCGAGTTGTTTTTGCGTTTGAG (SEQ ID NO: 3)). The absence ofD, 
mauritiana flanking sequence was checked using oJLld2 (SEQ ID NO: 1) and oJL104 
(SEQ ID NO: 2) as described above. In addition, a PGR positive control was performed 
on each DNA sample using oligonucleotides located in the cha-1 gene. When heat- 
shocked (1 hour at 35°C) POs were moved to fresh plates and eggs were collected for the 
next 24 hours. ^During experiment #5 the stabiUty of the oxExl64[Mosl; roU6(sd)] 
transgene reached 75 % while in previous experiments it was approximately 20 %. 



Transposase 
construct 


Transposition frequency 




no heat-shock 


with heat-shock 


gIh-2: :MosTransposase 


exp#l: 2/108 1.9% 
exp #2; 0/104 0 % 
exp#3:l/90 1.1% 


exp#l:ND 
exp #2: ND 
exp #3: 0/65 0% 
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hsp: :MosTransposase 


exp#l:ND 


exp #1:4/65 


6.2 % 




exp #2: 0/33 


exp #2: 3/98 


3.1 % 




exp#3:0/39 


exp #3: 6/87 


6.8 % 




exp#4:ND 


exp #4: 5/85 


5.9 % 




exp #5: 0/44* 


exp #5: 15/34* 


44.1 % 



Example 4 - Mos 1 - Mobilization in the Germline Using a Heat-Shock Promoter 

The glh--2 promoter expresses the transposase in the germline constitutively. 
Constitutive expression of the transposase has two disadvantages. First, crosses must be 
5 set up fresh every generation to guarantee that the array remains intact and does not 
accumulate inherited changes. Second, because the tranposase was expressed in the 
germline early in development, events identified in the progeny might not be independent 
but might have occiuxed when the germline was still comprised of only few cells. 
Expression limited to adults can be achieved by using a heat-shock promoter. Expression 

10 of the transposase could be induced after a strain containing the transposase and 

transposons had been propagated and expanded to many animals. In addition, heat- 
shocking animals with mature germlines would maxinoize the independence of insertion 
events. Animals expressing the transposase under the control of the heatshock promoter 
and bearing the integrated transposon (oxIs25/dpy-l 1; oxExl66[hsp::Mos Transposase]) 

15 were heat-shocked. POs could only be heat-shocked for 45 minutes; such animals were 
almost paralyzed, stopped eating and had low brood sizes. Longer heatshock caused the 
animals to die. It was speculated that this lethality is due to high rates of transposition in 
somatic cells. Ubiquitous expression of the transposase would cause double strand breaks 
in the chromosome at the site of integration in every cell which may cause cell cycle 

20 arrest or apoptosis. G. Evan, T. Littlewood, Science 281, 1317 (1998). 

Fl progeny were analyzed for catastrophic excision, that is, for the appearance of 
nonRol nonDpy progeny (Figure 6A). In contrast with results obtained using the glh-2 
expression vector, catastrophic excision was not observed. Only rare nonRol progeny 
were generated which were likely to be the result of recombination between the array and 

25 the dpy marker (Figure 6B). However, by analyzing the Dpy progeny of heat-shocked 
animals, it v^as discovered that the heatshock construct caused the efficient insertion of 
Mos into new locations in the genome. Approximately 27% (25/94) of Fl or F2 Dpy 
worms carried novel transposon insertions (Figure 68). In addition to novel insertions, 
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recombinant chromosomes containing both the dpy marker and the original array arose at 
high frequency. These were identified as Dpy animals containing transposons flanked by 
Drosophila DNA. This hotspot for recombination is likely to arise as a result of double 
strand breaks introduced into the array by the transposase. A. R, Lobe, C. Tinamons, I. 
Beerman, E. R. Lozovskaya, D. L. Hartl, Genetics 154, 647 (2000). 

It was then tested whether transposition could occur from an extrachromosomal 
array using the heatshock promoter construct to express the transposase (oxExl66; 
oxExl64[Mosl; rol-6(sd)]). Hermaphrodites bearing both arrays were heat-shocked as 
young adults. The nonRol progeny were analyzed by PGR for transposition events. New 
insertions were observed in 8.9% of the Fl (33/369 progeny. Table 1). Since transposition 
could have occurred into the transposase containing array, F2 animals that lost this array 
were isolated from eight transposon-bearing strains. Genomic DNA was prepared from 
these strains and analyzed for the presence of Mas L The transposon was still detected in 
all 8 strains, thus demonstrating that the transposon had not inserted in the array. No 
insertions could be detected in 1 16 Fl clones derived from non-heat-shocked parents. The 
frequency of transposition was low but one of the main limiting factors is the stabiUty of 
die extrachromosomal array that is used as a transposon source. The initial experiments 
were performed when the array was only 20% stable and transposition frequencies were 
in the range of 5%; when the array matured and was about 75 % stable, transposition 
frequency reached 44%. Generating more stable extrachromosomal arrays could increase 
the frequencies of transposition. 

The heatshock promoter was able to drive expression of Mos transposase in the 
germline and to promote transposition events at a higher rate than obtained usiug the 
glh'2 construct. Temperature has been shown to affect transposition frequency in other 
organisms. D. Garza, M. Medhora, A. Koga, D. L. Hartl, Gmetics 128, 303 (1991). One 
possible explanation for the efficient transposition observed after heatshock is that 
chromatin structiure is somehow altered by the heatshock. Therefore, it was tested whether 
heatshock itself could account for the difference in transposition frequencies. Parents with 
extrachromosomal arrays canying the g/A-2-transposase construct and carrying the 
transposon were heatshocked and progeny were tested for transposition. The frequency of 
transposition was not improved by the heatshock treatment (Table 1). Thus, heatshock 
itself does not facilitate efficient transposition. 
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Example 5 - Transposition in Somatic Cells 

To determine whether the Mosl element could be mobilized in C elegans ceDs, 
Mosl transposition in somatic cells was first analyzed. The gene encoding ih&Mosl 
transposase was engineered to improve expression in the worm and placed under the 
control of a heat-shock promoter. The Mos transposase encoding sequence was PCR 
amplified out of pBluescribe M13+/Afoj7 (M. Medhora, K. Maniyama, D. L. Hartl, 
Genetics 128, 31 1 (1991), modified as described in Figure 5A and subcloned as a Mlu I- 
Nhe I fragment between the hsp'16'48 promoter (H. G. van Luenen, S. D. CoUoms, R. H. 
Plasterk, Embo J, 12, 2513 (1993)); E. P. Candido, et al,, Genome 31, 690 (1989) and the 
glh'2 y untranslated region (fragment 35383 to 36190 in cosmid C55B7) (M. E. Gruidl, 
et al., Proc. NatL Acad. ScL USA 93, 13837 (1996) (Figure 5A). The resulting construct 
was used to generate the extrachromosomal array oxExl66fhsp::MosTransposaseJ, (hn- 
1 5(n765) hermaphrodites were injected in the syncitial gonad with a mixture of the 
following gel-purified firagments: a Hind IH-Eco RI fi:agment of hsp::Mos Transposase 
(injection concentration: 10 ng//zl), aPst I-Bsi WI fi-agment of the ofin'l::gfp construct 
(pPD97/98) that expresses GFP in the coelomocytes (gift of PiaU Sengupta) (injection 
concentration: 5 ng///l) and a Kpn I-Eag I fi-agment of EKL15(//n-7 J +) (S. G. Clark, X. 
Lu, H. R. Horvitz, Genetics 137, 987 (1994)) (injection concentration: 10 ng//^l), Plasmid 
backbones were removed firom all purified fragments, Eco RV-digested N2 worm 
genomic DNA was coinjected at a concentration of 70 ng/Ail, Another extrachromosomal 
array, oxExl64[Mosl; rol-6(sd)], contained the Mosl transposon. (Jin-15(n765) 
hermaphrodites were injected with a 2.2 kb Xho I-Hind III fragment of pBluescribe 
M13+/M?57 that contains the 1.3 kb Mosl element flanked by D, simulans sequences (M. 
Medhora, K. Maruyama, D. L. Hartl, Genetics 128, 311 (1991) (injection concentration: 
10 ng///l) and a 2.2 kb roU6(sd) fragment of pRF4 (J. M. Kramer, R. P. French, E. C. 
Park, J. J. Johnson, Mol Cell Biol 10, 2081 (1990) (injection concentration: 10 ng///l), 
Eco RV-digested N2 worm genomic DNA was coinjected at a concentration of 80 ng/^1 
to increase the complexity of the array; this array also contained the dominant genetic 
marker rol-6(sd) which causes animals to roll instead of swimming in a sinusoidal 
fashion. These two strains were crossed and progeny carrying both arrays were heat- 
shocked as yomg adults. After 12 hours, the heat-shocked animals were harvested and 
genomic DNA was prepared. Mosl transposition was detected using the strategy 
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developed by van Luenen et al See H. G. van Luenen, S. D. Colloms, R. H. Plasterk, 
Embo J. 12, 2513 (1993). Specifically, insertions v^ere identified by PGR amplification 
using one set of primers complementary to the transposon and another set compleraentaiy 
to an arbitrary target gene. DNA purification and PGR were performed as described in H. 
G. van Luenen, S. D. Colloms, R. H. Plasterk, Embo J. 12, 2513 (1993). The primers in 
5 Mosl were o JL88 (5 '-CGCATGCGGCTTACTCAC (SEQ ID NO: 4)) first PGR; and 
0JL89 (5'-GGCCCCATCCGATTACCACCTA (SEQ ED NO: 5)) second PGR. Primers 
in imc-49 were oJL19 (S'-GGGAAAGGCATAGCAACTGTA (SEQ ID NO: 6)) first 
PGR; and oJL20 (5'-TTGATGGGGAAAAGGAGGGGT (SEQ ID NO; 7)) second PGR, 
Primers in gpa-2 were the same as described in H. G. van Luenen, S. D. Golloms, R H. 

10 Plasterk, Embo J. 12, 2513 (1993). PGR products were gel-purified and sequenced using 
oJL89 (SEQ ID NO: 5) as a primer. A PGR product can be obtained only if a transposon 
has integrated into the target gene. The method is sensitive enough to detect a single 
insertion in the target gene in a single somatic cell of an adult animal. Insertions in two 
genes were assayed: the gpa-2 gene which encodes a G protein subunit (R. R. Zwaal, J. E. 

15 Mendel, P. W. Sternberg, R. H. Plasterk, Genetics 145, 715 (1997)), and the imc'49 gene 
which encodes a GAB A receptor. B. A. Bamber, A, A. Beg, R. E. Twyman, E. M. 
Jorgensenj Neiiroscl 19, 5348 (1999). Mosl insertions were detected in both genes (2.5 
il,0 inserts in 10 ng of genomic DNA, mean ± S JD., n=5 experiments). Given that the 
maximal distance of the inserts firom our gene primers was approximately Ikb, it was 

20 estimated that an average of 10 insertions occurred per cell in heat-shocked animals. 
Insertions were also detected at low fi-equency in worms that contained the transposon 
array but lacked the transposase expression constmct (0.09 insertions in 10 ng DNA, n=2 
experiments). These data indicated that low levels of Mosl tranposase were expressed 
from the intact Mosl transposons in the extrachromosomal array. 

25 To demonstrate that these transposon insertions represented bona fide 

transposition events, PGR products were gel-purified and the sequence of the insertion 
sites firom the somatic transposition assays was detemiined. In all cases, the Mosl 
inverted terminal repeats were complete, the Drosophila sequences that flanked Mosl in 
the donor plasmid were no longer present, and the insertions all took place at a TA 

30 dinucleotide. Transposon insertions were distributed uniformly in exons, introns and 3' 
noncoding sequences of gpa'2 and unc-49 (Figure 5B). Gomparison of 22 insertion sites 
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did not reveal a strong consensus site apart from a bias toward a T at position +1 with 
respect to the TA dinucleotide (Figure 5C). These data demonstrated that Mosl can hop 
into C. elegans chromosomes and that the transposase was sufficient to catalyze insertion 
without Drosophila host factors. 

Example 6 - Introduction of Mariner Transposon Copies into the C. eleeans Genome 
The fulHength copy of the homfly mariner transposon ^i/^ar was gel-purified to 
remove non-nematode plasmid sequences. Purified ^w/mar was injected with linearized 
C elegans genomic DNA and the j'ol-Sfdm) plasmid into lin-15(n765ts) worms and 
unstable transgenic strains were recovered. Due to the presence of rol-dfdm) ia the array, 
transgenic animals roll instead of displaying normal sinusoidal locomotory movements. 
These animals are Lin when grown at the nonpermissive temperature because they are 
genotypically lin-15(-). This array was integrated into a chromosomal location to generate 
the oxIs21 insertion. oxIs21 was mapped to chromosome X, 2.5 m.u. away from the lon-Z 
locus. 

Example 7 -The Mariner Transposase Can Excise Mariner Trans posons from C 
Elegans Chromosomes in the Germline 

Engineered Himarl was inserted in the glh-2 germline expression cassette 
described above. The glh'2::HimarJ construct was co-mjected with linearized C. elegaiis 
genomic DNA and the lin'15(+) plasmid into lin'15(n765ts) worms. The oxExJJS 
extrachromosomal array is transmitted at each generation to a large flection of flie 
progeny. 

Iin-15(n765ts); oxExll9[glh'2::Himarl;lin'15('\-)] males were crossed into 
lin-15(n765ts) oxIs21[Autniar; rol-6(sd)J hermaphrodites. As predicted, animals of the 
cross-progeny were Rol nonLin. At the next generation, it was expected that 1/3 of the 
Rol animals would be found to be homozygous for oxIs2L However, among 48 Rol 
nonLin cloned individuals, none segregated more than approximately 75% Rols, while 6 
of 15 Rol Lin hermaphrodites segregated 100% Rol progeny. After carefiil 
characterization of the progeny of parent animals exhibiting various phenotypes, it was 
concluded that oxExllS could elicit the reversion of the Rol phenotype. Presumably, the 
reversion is caused by excision oiihoAutmar transposons from the integrated array which 
in trnn leads to loss of the adjacent rol-6(dm) genes by imprecise repair of the locus. It 
was concluded that the mariner transposase can excise mariner transposons from C, 
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elegans chromosomes in the germline. 
Example 8 - Mobilization of a Heterologous Mariner Transposon id the C. ele^ans 

Genome 

Materials and Methods: 
Reagents 

Mosl containing strain: 

The transgenic strain EG1638 that contains Mosl has been generated by 
coinjection of lm'15(n765) worms with: 

- the 22 kB Xho l-Hind III fragment of pBIuescribe M 13+/Mosl, M. Medhora et 
al.. Genetics 128:311-318 (1991); (injection concentration: 10 ng/^l) 

- the 2.2 kB Hind HI rol~6 rescuing fragment containing the semi-dominant 
mutation rol-6(sulO06), J. M. Kramer et al.,Mo/ Cell Biol 10:2081-2089 (1990); 
(injection concentration: 10ng//zl) 

- EcoRV digested genomic DNA prepared from N2 worms (injection concen- 
tration: 80 ng//zl) 

The resulting strain lin45(ts); oxExl64[Mosl; roU6(sd)] exhibits aRol Muv 
phenotype when grown above 20°C. The Muv phenotype is not expressed when worms 
are grown at IS^'C. 
Mas 1 transposase expressing strain: 

As shown in Figure 2, the expression vector pJL44 (HSP::MosTase::gIli-2) 
contains the following elements: 

- a 377 bp Hspl6-48 heat-shock promoter fragment recovered by PGR from 
pRP176, H. G. van Luenen et al., £:MB0 / 12:2513-2520 (1993), usmg the oligos 
oJL21 5'-CGAAGCTTGCTGGACGGAAATAGTGG (SEQ ID NO: 19) and 
oJL22 5'-CGACGCGTTCTTGAAGTTTAGAGAAT (SEQ ID NO: 20), 

- a 1088 bp fragment containing the Mosl transposase coding sequence amplified 
by PGR from pBIuescribe M 13+/Mosl using oJL77 5'- 

GCACGCGTTATGTCGAGTTTCGTGCCGAATAAAG (SEQ ID NO: 21) and 
oJL78 5'- 

GCGCTAGCTATTCAAAGTATTTGCCGTCGCTCGCGACACATTTTTCCCA 
(SEQ ID NO; 22), An artificial intron (5*- 
GTAAGTTTAAACATATATACTAACTAACCCATGGATTA- 
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TTTAAATTTTCAG-3' (SEQ ID NO: 18) was inserted at position 264 with 
respect to the ATG. 

- a 300 bp fragmeat containing the glh'2 3TJTR (nt 3287 to 4087 wifli respect to 
glh'2 start codon) recovered form a 6.3 kb glh'2 genomic fragment subcloned in 
pBluescript KS (Stratagene) (M. E. Gruidl et al, Proc Natl Acad Sci USA 
93:13837-13842 (1996); gift of Karen Bennett, University of Missouri). 

The transgenic strain BG1643 that contains the Mosl transposase expression 
vector has been generated by coinjection of lin-15(n765) worms with: 

- the Hind m-EcoRl fragment of pJL44 (injection concentration: lOng/^1) 

- the Eag I-Kpn I Iin-15 genomic rescuing fragment from EKL15, S. L. McZntire 
et al.. Nature 3S9:S70'S76 (1997) 

- the Pst I-BsiW I fragment of pPD97/98 that drives expression of the Green 
Fluorescent Protein in the coelomocytes (gift of Piali Sengupta, Brandeis 
University) (injection concentration: 10 ng/fA) 

' EcoRV digested genomic DNA prepared from N2 worms (injection 
concentration: 80 ng//zl) 

The resulting strain lin-lSfts); oxExl66Ptsp::MosTase:, glh'2; pPD97I98; lin- 
15(+)] has a wild-type phenotype. The presence of the extrachromosomal array causes 
expression of GFP in the coelomocytes which can be visualized using fluorescence 
microscopy. 

Mobilization of the transposon in the C elegans genome 

Mobilization of Mosl was achieved by crossing the transposase-expressing strain 
into worms containing the Mos 1 transposon-contaming array. Iin45(ts)\ oxExl66Pisp:: 
MosTase:.'glh-2; pPD97198; lin-15(+)J hennaphrodites were crossed with N2 males at 
25 °C. Non-Muv males lin-15(ts): oxExl66wtTe crossed with lin-lSfts); oxExl64[Mosl; 
roU6(sd)] Rol non-Muv hermaphrodites previously grown at 15°C, 

The cross was kept at 20 °C. Late L4 larvae or young adult Rol worms were 
transfeired to a fresh plate and heat-shocked for 1 hour at 35 °C. After 6 hours, non-Muv 
Rol PO animals (lin45(ts); oxExl64; oxExl66) were transferred to a fresh plate and 
allowed to lay eggs for 48 hours. A fraction of the Fl animals contain insertions of Mos 1 
in their genome and can be screened for mutant phenotypes. 
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Identification of transposon insertion sites 

Mosl insertion were identified by inverse PGR, as shown in Figure 3. Genomic 
DNA was prepared according to standard procedure. Approximately 100 ng of genomic 
DNA was digested by Sau3A in a 10 /zl volume for 3 to 14 hours. The restriction enzyme 
was inactivated by heating for 20 minutes at 70 ''C. Fragments were circularized by self- 
ligation (overnight incubation at 15 °C with 5 xmits of T4 DNA ligase). 

3 lA of ligated DNA was used for PGR amplification. A first round of 
amplification was performed using the primers oJLlOS 5*- 
TCTGCGAGTTGTTTTTGCGTTTGAG (SEQ ID NO: 3) and oJL114 5'- 
AAAGATTCAGAAGGTCGGTAGATGGG (SEQ ED NO: 10) (30 cycles, 45" at 94^C / 
1' at 60°C / ri5" at 72*'C, magnesium chloride concentration: 1.5 mM). The product of 
the first amplification was diluted 100-fold and subjected to a second round of 
amplification using the nested primers oJLl 15 5'-GCTCAATTCGCGCCAAACTATG 
(SEQ ID NO: 11) and 0JLII6 S^GAACGAGAGGCAGATGGAGAGG (SEQ ED NO: 
12) (25 cycles, 45" at 94"C / V at 62°C / I'lS" at 72°C, magnesium chloride 
concentration: 2.5 mM), Resulting firagnients were run on an agarose gel, gel-purified and 
sequenced either directly or after subcloning. 

Figure 4 contains the sequence of an inverse PGR product demonstrating insertion 
of Mosl in chromosome X. Nucleotides in capital letters are fi-om the Mosl transposon. 
C. elegans flanking genomic region is in lower case. It matches the Y47C4.Contig215 
sequence from chromosome X available at the Sanger Centre. 

Mosl, a mari?ier-]ikG transposon isolated firom Drosophila mauritiana was used. 
Transgenic worms containing Mosl in an extrachromosomal array were crossed with 
transgenic worms containing an expression vector in which a heat-shock promoter (Jisp 
16-48) drives the expression of the mos transposase (Figure 1), Cross-progeny containing 
both the Mosl transposon and the mos transposase were isolated. Heat-shock of these 
worms induced the expression of the transposase which in turn caused Mosl elements to 
transpose firom the extrachromosomal array into the C. elegans genome. Five insertions 
were isolated, for a rate of one in seventeen animals analyzed. However, this array is only 
20% stable per generation. Thus, there in on average one transposition into chromosomes 
for every three gerai cells exposed to the transposon. 

Some insertions will disrupt genes and cause mutant phenotypes. Mutant worms 
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are outcrossed with wild-type worms containing no Mosl transposon. Since the insertion 
responsible for the mutation cosegregates with the mutant phenotype, it is possible to 
isolate the single relevant Mosl insertion after only a few outcrosses. Genomic DNA is 
then prepared jfrom the outcrossed mutant. Regions flanking the transposon are recovered 
by inverse PGR and sequenced. Comparison of flanking sequences with the C elegans 
genome sequence allows immediate identification of the mutated gene. This new 
mutagenesis system will significantly speed up the identification of genes of interest 
using C elegans as a genetic model 

Example 9 - Mosl Mutagenesis and Rapid Cloning of Genes 

In one embodiment, the method described in this invention is capable of 
generating mutations which can be rapidly cloned based on the Mosl unique DNA tag. 
To demonstrate that this is true, mutants have been identified and the relevant genes have 
been cloned using inverse PGR Specifically, a morphological mutant in C. elegans was 
isolated which causes the worms to be short and squat. Such mutations are called dumpy 
mutations and are given the three letter designation "dpy". A dumpy animal was 
identified after mobiUzation of the wild-type Mosl transposon. DNA was prepared, 
cleaved with the restriction enzyme Sau3A, and religated. Inverse PGR was performed 
using primers contained within the transposon but facing outward. The amplified 
firagment was sequenced. Th^Mosl element was inserted 175 nucleotides 5' of F54D8.1, 
which encodes a collagen protein. An inspection of the genetic map demonstrated that 
this insertion is in a chromosomal interval which also contains the dpy-l 7 gene which had 
been previously defined by point mutations using chemical mutagens by Sydney Brenner 
in 1974. A complementation test was performed and the test demonstrated that this 
mutation was an allele of dpy-l?. Thus, the mefhod is capable of rapidly demonstrating 
the molecular identity of a gene which had remained unknown for almost 30 years. 
Mutants incapable of detecting high osmotic gradients (Osm) were also screened for. The 
first Osm mutant identified was cloned in a similar manner and proved to be an insertion 
of Mosl in exon 10 of the eat'4 gene. 

Example 10 - Targets of Transposase and Transposon 

For Mosl insertions to be usefiil for the cloning of mutated genes, the transposase 
must specifically mobilize Mosl and not other mariner elements. The C elegans genome 
contains endogenous transposons. Apart firom the most active Tel and Tc3 transposons, 
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which are distantly related to Mosl, every haploid genome contains at least 55 copies of a 
Mariner Like Element (MLE), which is closely related to Mosl. M. M. Sedensky, S. J. 
Hudson, B. Everson, P. G. Morgan, Nucleic Acids Res, 22, 1719 (1994); H. M. 
Robertson, D. J, Lampe, Mol. Biol. Evol 12, 850 (1995). Since in a few cases 
transposases of the Mariner family have been shown to cross-mobilize distinct but related 
transposons (P. Sundararajan, P. W. Atkinson, D, A. O'Brochta, bisect Mol Biol 8, 359 
(1999)), it was tested whether Mos transposase expression had triggered transposition of 
the endogenous MLEs. Eight strains in vAAchMosl insertions had occurred were 
analyzed by Southem blot for changes in MLE distribution. No changes in MLE 
distribution were detected. Worm genomic DNA oilin-15(n765) and Afo^i-containing 
strains was extracted, Bgl U digested and run for Southem blot analysis using standard 
procedures. Oligos oJL132: 5'-ATATGCGGTGCGATGGGTGAG (SEQ ID NO: 8) and 
OJL133: 5^GGCGAACGCGATGAGAAGAAAG (SEQ ID NO: 9) were used to amplify 
a 842 bp MLE fragment from N2 worm genomic DNA. The PGR product was sequenced 
and used for probe synthesis, (data not shown) indicating that Mos transposase is specific 
for Mosl in the C elegans germline. 

How many insertions occurred in every animal and what were their distributions? 
The number of chromosomal insertions per strain was determined by Southem blot 
analysis in eight insertion strains. Only one insertion per strain was detected (Figure 7A). 
To detennine the location of the mobilized transposons, the left junctions of 17 insertions 
were cloned using inverse PGR. Approximately 100 ng of total genomic DNA was 
digested with Sau3A, self-hgated under dilute conditions, and then 3 % of the ligation 
was subjected to two rounds of nested PGR using the following primers: oJLl03 (SEQ ID 
NO: 3)/oJLl 14 5'-AAAGATTCAGAAGGTCGGTAGATGGG (SEQ ID NO: 10) (first 
PGR), oJLl 15 5'-GCTCAATTCGCGCCAAACTATG (SEQ ID NO: 1 1) / oJLl 16 5'- 
GAACGAGAGGCAGATGGAGAGG (SEQ ID NO: 12) (second PGR). PGR products 
were purified on agarose gel and sequenced using oJLl 15 (SEQ ID NO: 1 1) as a primer. 
In agreement with the Southem blot experiments, only one insertion per strain was 
detected. Insertion sites were distributed on all six chromosomes (Figure 7B), 
Transposition occurred into exons, introns and intergenic regions (Table 2). Sequences 
flanking both sides of the transposon were determined for nine of the localized insertions. 
In each case, the inverted temainal repeats were complete and flanked by a TA 
dinucleotide that arose from the duplication of the original TA found in the genomic 
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Table 2. Properties of Mosl genomic insertions. Mosl flanking were compared with the 
C elegaiis genome sequence. Physical location in the genome is given as the nucleotide 
position of the corresponding clone in the C elegaiis database (ACeDB). 



Isolation 
name 


Transposon source 


Physical location 


Meipolated 
genetic location 


Genefinder 
nredictions 


oxTil 


Extrachromosomal 


Y65B4BL®27,362 


LGI, - 19 


Intergenic 


oxTi2 








Intergenic 


oxTi3 


Extrachromosomal 


M01E5 @19,740 


LGI, + 29.9 


Intergenic 


VIA X 1*T 


HAtracnromosomai 


1 liK^Z {Q 4,y4o 


LGn, +0,1 


Exon #4 of 
F41G.12 


oxTi5 


Extrachromosomal 


K08E5@ 31,631 


LGin, + 4.61 


Intergenic 


oxTi6 


Extrachromosomal 


H23L24@ 4,529 


LGIV, +3.9 


Intergenic 


OX 11 / 


Extrachromosomal 


K08D8 @ 4234 


LGIV, + 6.6 


Intergenic 


oxTiS 


Extrachromosomal 


R09B5@ 22,929 


LGV, -19.0 


Exon #6 of 
R09B5.12 


oxTi9 


Extrachromosomal 


Y69H2 @ 39,771 


LGV, +17.49 


Intron #5 of 
Y69H2.4 


oxTilO 


Extrachromosomal 


Y47C4A 


LGX, .20 


Repeat 


oxTill 


Extrachromosomal 


C34E11 @ 12,022 


LGX, +6.55 


Exon #10 of 
C34E11.1 


oxTil2 


Integrated array 


Y71A12B 0.50,370 


LGI. +21 


Intergenic 


oxTil3 


Integrated array 


Y48G1C (2} 19,916 


LGI, . 19.8 




oxTil4 


Integrated array 


C17F4 @ 22,793 


LGn, -8.06 


Exon #18 of 


oxTilS 


Integrated array 


F35C5 @ 5,735 


LGn, +10 


Intergenic 


oxTil6 


Integrated array 


C06B3 @ 14,747 


LGV, +5.79 


Intergenic 


oxTil7 


Integrated array 


R01H2(S). 20.193 


LGin, " 0.86 


Intergenic 



Comparison of the insertion site sequences did not reveal a strong consensus motif 
for the target DNA. Molecular analysis of the insertions therefore demonstrated that Mosl 
insertion obeyed properties previously observed for mariner class transposons. However, 
a formal possibihty remained that Mosl hopped into the genomic DNA present in one of 
the extrachromosomal arrays and that recombination occurred subsequently between the 
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aixay and the genome. To rule out this possibility, the insertion oxTi4 which was 
positioned 35 kb away from snt-1 was genetically mapped In agreement with this 
physical location, oxTi4 was mapped less than 2.5 map units from snt-l\ 20 Snt-1 
individuals were cloned from the self-progeny ofoxTi4lsnt'l hermaphrodites. None of the 
mutants segregated oxTi4, The presence of oxTi4 was determined by PGR using one 
Mosl primer pointing towards the right end of the trausposon (oJL89 (SEQ ID NO: 5)) 
and one primer in the genome (oJL129 5*-CCAAATGCGTCTGTCCCACTC (SEQ ID 
NO: 13)). A PGR positive control was performed on each DNA sample using cha-l 
primers. 

Example 11 - Remobilization of a Genomic Transposon Insertion 

The transposition events documented above were all excisions from an array of 
transposons residing in Drosophila DNA. To determine whether the transposase acts on a 
single Mosl transposon in a C. elegans chromosome, the oxTi4 insert was remobilized. 
Primers for PGR were designed flanking the oxTi4 insertion. A first PGR roimd was 
performed with primers located 1671 nt upstream and 3144 bp downstream to oxTi4 
(respectively oJL149 5'- AAGTATGGGGAAAGGACGGGAGAG (SEQ ID NO: 14) and 
OJL150 5'. GGATTGGGAGGTTTGTGGGTTCT (SEQ ID NO: 15)). A second round was 
performed using primers 493 bp upstream and 913 downstream to oxTi4 (respectively 
OJL145 5*- AGAGGGAGCATTTTGTAGTGT (SEQ ID NO: 16) and oJL148 5»- 
AGGGTGGGTGGTAAGTTGGTAGAG (SEQ ID NO: 17)). Short PGR products were gel 
purified, subcloned and sequenced. The transposase-expressing transgene 
{oxExl67[glh-2:Transposase]) was crossed into animals homozygous for the oxTi4 
insertion and DNAs from the progeny were analyzed for amplified fragments shorter than 
the insertion. These shorter PGR products represented a variety of excision events, 
including the three nucleotide excision footprint previously characterized for Mosl 
excisions (G. Bryan, D. Garza, D. Hartl, Genetics 125, 103 (1990)), as well as smaller 
footprints, excisions and even incomplete excisions (Table 3). Since these products could 
arise from excision events in somatic cells, progeny animals that lost the transposase 
expression array were analyzed. Pools of 15 individuals from oxTi4: 
oxExl67[glh-2:Traiisposase] progeny that lost the transposase array were transferred to 
fresh plates and allowed to lay eggs for 24 hours. Adult worms were then analyzed by a 
single round of PGR using the primers oJL145 (SEQ ID NO: 16)-oJL148 (SEQ ID NO: 
17). Sixty individuals were cloned from the progeny of the pool exhibiting short PGR 
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product and analyzed at the next generation to identify clones that lost oxTi4. One animal 
was identified among 954 progeny in which excision of the transposon had occmxed. In 
this animal the excision left a 3 bp footprint and the duplicated TA dinucleotide which 
together resulted in a +2 frameshift . These data indicate that single copies of the Mos] 
Drosophila transposon can excise from C elegans DNA in the germline to introduce 
5 frameshift or deletion mutations at the transposon insertion site. 

Table 3. Lesions generated by excision of the oxTi4 insert. The extrachromosomal 
[glh-2:Transposase] transgene was crossed into animals homozygous for the oxTi4 
insertion. PGR was used to analyze the oxTi4 insertion site after. the loss of Mos L Pools 

10 of 15 individuals from oxTi4; oxExl67[glh'2:Transposase] progeny that lost the 

transposase array were transfeired to fresh plates and allowed to lay eggs for 24 hours. 
Adult worms were then analyzed by a single round of PGR using the primers oJL145 
(SEQ ID NO: 16)-oJL148 (SEQ ID NO: 17). Sixty individuals were cloned from the 
progeny of the pool exhibitmg short PGR product and analyze at the next generation to 

15 identify clones that lost oxTi4. Top line: sequence of oxTi4. Lower case: Mos J sequence. 
Upper case: genomic sequence. Bold: TA dinucleotide duplicated during Mosl insertion. 
Bottom lines: excision products. Dash: deleted base pairs. The insertion (bottom line, 
italic letters) corresponds to an internal fragment of Mosl (nt 147 to 178). 

20 

CTCTTTTCCAGACGAGTAccaggtgtac tacacctgaXATATCGTTTTGTTCGTT 

CTCTTTTCCAGACGAGTA TATATCCTTTTGTTCCTT 

CTCTTTTCCAGACGAGTA- aTATATCCTTTTGTTCCTT 

25 CTCTTTTCCAGACGAGTA- tgaTATATCCTTTTGTTCCTT 

CTCTTTTCCAGACGAGTAc -TATATCCTTTTGTTCCTT 

249 bp deletion— -tgaTATATCCTTTTGTTCCTT 

CTCTTTTCCAGACGAGa- 143 bp deletion- 

CTCTTTTCCAGACGAGTA 188 bp deletion 

30 463 bp deletion- 

CTCTTTTCCAGACGAGTA attfitttactctcagtgcagtcaacatgtcgaTATCCTTTTGTTCCTT 
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Example 12: - Engineering Mutations in the C. elegans Genome bv Transgene 
Instructed DNA Double Strand Break Repair Following Mosl Excision 
Germline expression of the Mos transposase under the control of the glh-2 
promoter causes reexcision of single copies of Mosl inserted in the C elegans genome. 
Remobilization of the transposon causes a DNA double strand break (DSB) at the site of 
5 excision which is repaired by the cellular machinery. In 1992, R. Plasterk and J, Groenen 
(EMBO J, II :287) demonstrated that a DSB caused by excision of a Tel transposon in a 
mut'6(st702) background can be repaired using DNA contained in an extrachromosomal 
array that carries sequences homologous to the region of excision. As a result, sequences 
flanking the break can be replaced by sequences contained in the transgene. This strategy 

1 0 provides a way to engineer mutations in the genome. However, this approach never 

became a routine strategy probably because transposition is not controlled and excision 
occmrs at low rates in such mutant strains. 

The controlled transposition of Mosl provides an efficient tool to use this strategy 
for engineering of the C elegans genome: after a Mosl insertion has been identified in 

15 the gene of interest, a transgene is constmcted with mutated sequences homologous to the 
region of insertion. The transgene that carries the glh-2: :MosTransposase expression 
vector is crossed into the starin that contain the Mosl genomic insertion and the template 
transgene. Expression of Mos transposase causes Mosl excision and the progeny is 
screened by PGR for transgene instructed repair at the excision site (Figure 8). 

20 The feasibility of regulated mobilization of a heterologous transposon in the C 

elegans germline was thus demonstrated. The characteristics of Mosl transposition 
suggest that it could be used as a technique for tagging mutant genes. First, the Mos 
transposase does not activate transposition of endogenous transposons. Second, 
transposition of Mos in the germline is strictly dependent on the expression of the 

25 transposase. In this respect the use of a heat-shock promoter to express the transposase is 
of particular interest since it provides a convenient way to tum transposition on and off 
and to stabiUze new inserts. Third, insertion sites of Mosl in the genome do not exhibit 
strong sequence bias. Transposons were inserted into exons, introns and intergenic 
regions. Comparison of the insertion sites did not reveal a strong consensus sequence 

30 apart from the TA dinucleotide. Fourth, excision and insertion frequencies can be 

differentially manipulated by expressing the transposase xmder the control of different 
promoters. The heatshock promoter caused very low rates of excision and loss of the 
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transposon array but high rates of transposon insertion. The glh'2 promoter construct 
caused a low rate of insertion but a high rate of excision and loss of the transposon array. 
Since transposon insertions frequently do not disrupt gene function in C elegans even if 
the insertion occurs in an exon (A. M. Rushforth, B. Saari, P. Anderson, Mol Cell Biol 
13, 902 (1993); A. M. Rushforth, P. Anderson, Mol Cell Biol 16, 422 (1996)), 
transposons are usually remobihzed to generate deletion alleles (D. Eide, P. Anderson, 
Mol Cell Biol 8, 737 (1988); R. R. Zwaal, A. Brocks, J. van Meurs, J. T, Groenen, R, H. 
Plasterk, Proc, Natl Acad, ScL USA90, 743 1 (1993)), It was thus demonstrated that the 
glh-2 expression construct can be used to generate deletion alleles of the genes containing 
Mosl insertions. 

MosI transposition in C. elegans wiU allow the development of two new genetic 
tools. First, mutations identified in forward screens using MosJ will allow the rapid 
cloning of the mutated gene. Second, a Ubraiy of insertions localized in the genome could 
be generated; the glh-2 expression construct could then be used to remobiUze these 
insertions at high frequency and generate deletion and frameshift mutations in genes of 
interest. 
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CLAIMS: 

1 . A method of regulated expression of a heterologous gene in cells of the gennline 
of C. elegam comprising the steps of: 

a. inserting a transgene construct into the C. elegans, wherein the construct 
comprises the heterologous gene operably linked to a promoter and a 3* 

5 xmtranslated region of a gene that is expressed in the C. elegans germline, 

wherein the promoter is an inducible promoter or a germline-specific 
promoter; and 

b. expressing the heterologous gene. 

2. The method of Claim 1, wherein the promoter is inducible. 

10 3 . The method of Claim 2, wherein the promoter comprises a heat-shock promoter. 

4. The method of Claim 2, wherein the promoter comprises a tetracycline-regulated 
promoter, 

5. The method of Claim 1, wherein the construct is substantially free of bacterial 
plasmid sequences. 

15 6. The method of Claim 1, wherein the construct is substantially free of repeated 
DNA sequences. 

7. The method of Claim 1, wherein the 3' untranslated region comprises ag/A-2 3' 
untranslated region. 

8. The method of Claim 1, wherein the promoter comprises a glh-2 promoter, 
20 9. A transgene construct for expression in C. elegans comprising a heterologous 

gene operably linked to a promoter and a 3 ' untranslated region of a gene that is 
expressed in the C elegans germline, wherein the promoter is an inducible 
promoter or a germline-specific promoter. 
10. The transgene construct of Claim 9, wherein the promoter is inducible. 
25 11. The transgene construct of Claim 10, wherein the promoter comprises a heat- 
shock promoter. 

12. The transgene construct of Claim 10, wherein the promoter comprises a 
tetracycline-regulated promoter. 

13. The transgene construct of Claim 9, wherein the construct is substantially free of 
30 bacterial plasmid sequences. 

14. The transgene construct of Claim 9, wherein the construct is substantially free of 
repeated DNA sequences. 
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15. The transgene construct of Claim 9, wherein the 3' untranslated region comprises 
a glh'2 3' untranslated region. 

1 6. The transgene construct of Claim 15, wherein the promoter conaprises a heat- 
shock promoter, 

17. The transgene construct of Claim 9, wherein the promoter comprises SLglh'2 
5 promoter. 

18. The transgene construct of Claim 9, wherein the heterologous gene is a 
transposase. 

1 9. The transgene construct of Claim 9, wherein the heterologous gene is a TC3 A 
transposase gene. 

10 20. A method of transposon-mediated mutagenesis in a C. elegans genome, 
comprising the steps of: 

a. introducing a transgene construct into the C. elegans genome, wherein the 
construct comprises a transposase gene which is operably linked to a 
regulable expression control element and a 3' untranslated region of a gene 

15 that is expressed in the C. elegans gennline; and 

b. expressing the transposase gene, such that a transposon in the C. elegaiis 
genome transposes, causing a mutation. 

2 1 . The method of Claim 20, wherein the transposons comprise endogenous 
transposons. 

20 22. The method of Claim 2 1 , wherein the transposons comprise Tc3 transposons. 

23. The method of Claim 20, wherein the transposase gene is a TC3 A transposase 
gene. 

24. The method of Claim 22, wherein the transposase gene is a TC3 A transposase 
gene. 

25 25. The method of Claim 2 1 , wherein the regulable expression control element is an 
inducible promoter. 

26. The method of Claim 26, wherein the promoter comprises a heat-shock promoter. 

27. The method of Claim 25, wherein the promoter comprises a tetracycline-regulated 
promoter. 

30 28. The method of Claim 20, wherein the construct is substantially foe of bacterial 
plasmid DNA sequences. 
29, The method of Claim 20, wherein the construct is substantially foe of repeated 
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DNA sequences. 



30. The method of Claim 20, wherein the 3' untranslated region comprises eiglh~2 3' 
xmtranslated region. 

3 1 . The method of Claim 30, wherein the regulable expression control element 
5 comprises a heat-shock promoter, 

32. The method of Claim 30, wherein the regulable expression control element 
comprises a glh-2 promoter. 

33 . The method of Claim 20, further comprising introduction of one or more 
additional copies of an endogenous transposon into the C elegmis germline. 

10 34. The method of Claim 33, wherein the endogenous transposon is a Tc3 transposon. 

35. The method of Claim 20, wherein the transposons comprise heterologous 
transposons. 

36. The method of Claim 35, wherein the heterologous transposons are introduced 
into the C. elegans genome: 

1 5 37. The method of Claim 35, wherein the transposons comprise Mos 1 transposons, 

38. The method of Claim 35, wherein the transposase gene comprises restriction sites 
5' of the start codon, restriction sites 5' of the stop codon, and an artificial intron 
in the transposase gene open reading frame. 

39. The method of Claim 35, wherein the regulable expression control element is an 
20 inducible promoter. 

40. The method of Claim 39, wherein the promoter comprises a heat-shock promoter, 

41 . The method of Claim 39, wherein the promoter comprises a tetracycline-regulated 
promoter. 

42. The method of Claim 35, wherein the construct is substantially free of bacterial 
25 plasmid DNA sequences. 

43. The method of Claim 35, wherein the construct is substantially free of repeated 
DNA sequences. 

44. The method of Claim 35, wherein the 3 ' \mtranslated region comprises a glh'2 3 ' 
untranslated region. 

30 45. The method of Claim 44, wherein the regulable expression control element 
comprises a heat-shock promoter. 
46. The method of Claim 44, wherein the regulable expression control element 
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comprises a glh-2 promoter, 

47. A method of introducing a heterologous DNA sequence into a C. elegans 
chromosome comprising the steps of: 

a. introducing a transposon into the C elegans^ wherein the transposon 
comprises the heterologous DNA sequence; 

b. introducing a transgene construct into the C elegans, wherein the 
construct comprises a transposase gene which is operably linked to a 
promoter and a 3' untranslated region of a gene that is expressed in the C. 
elegans gennhne; and 

c. expressing the transposase, such that the transposase integrates as a single 
copy into a C. elegans chromosome, 

48. The method of Claim 47, wherein the heterologous DNA sequence comprises a 
bacterial plasmid DNA sequence. 

49. The method of Claim 47, wherein the gene carried on the transposon is useful for 
selection or screening pmposes. 

50. The method of Claim 47, wherein the transposon contains FRT/FLP 
recombination sites. 

5 1 . The method of Claim 47, wherein the promoter is inducible. 

52. The method of Claim 47, wherein the promoter comprises a heat-shock promoter. 

53 . The method of Claim 47, wherein all bacterial plasmid sequences have been 
removed from the construct. 

54. The method of Claim 47, wherein the construct is substantially free of repeated 
DNA sequences, 

55. The method of Claim 47, wherein the 3' untranslated region comprises eiglh-2 3* 
untranslated region. 

56. The method of Claim 47, wherein the promoter comprises a glh'2 promoter. 
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